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Abstract: The phylogenetic relationship between the zerynthiids and the other groups of Papilionidae has 
long been a controversial issue. In this study, the complete nucleotide sequences of the mitochondrial 
genome ( mitogenome) of Sericinus montela belonging to the zerynthiids was determined using long PCR 
and conserved primer walking approaches. Meanwhile, the phylogenetic analysis of this species with other 
representative papilionid species was conducted to clarify their phylogenetic relationships on the 
mitogenomic level. The results showed that the entire mitochondrial DNA (mtDNA) molecule is 15 242 
bp in length with the content of A, T, G and C of 40.1% , 40.8% , 7.4% and 11.7%, respectively , 
and its nucleotide composition of the genome is highly A + T biased (80.8% ) ; all the 13 protein coding 
genes ( PCGs) use standard initiation codons ATN, and all the PCGs use common stop codon (TAA) , 
except for the ND4 and ND4L genes, which terminate with a single T; all tRNA genes form a typical 
clover-leaf secondary structure, except for the tRNA™ ( AGN), whose DHU arm forms a simple loop; 
there are twelve intergenic spacer regions ranging from 2 to 65 bp in size, and fifteen overlaps ranging 
from 1 to 8 bp in size in the mitogenome sequence. The neighbor joining and maximum parsimony 
phylogenetic analyses based on the 13 PCG sequences showed that Sericins montela and Luehdorfia 
chinensis form a clade that is sister to the Parnassius bremeri, suggesting that they should be designated as 
a taxon of tribal level within the subfamily Parnassiinae in the family of Papilionidae. 
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1 INTRODUCTION 


Mitochondrial genomes of metazoan animals 
consist of a circular molecular (15 —20 kb) , which 
It has a 
remarkably conserved set of 37 genes: 13 PCGs 
(coxl — 3, atp6, atp8, nadxl - 6, nad4L and 
cytb), 22 transfer RNA genes (tRNAs), 2 
ribosomal RNA genes ( rRNAs) , and a non-coding 


is separated from the nuclear genome. 


control element regulating the transcription and 
1999; 
Taanman, 1999). Although gene content of animal 


replication of the mitogenome ( Boore, 
mitogenome is well conserved, evidence has been 
provided that mitogenomes with gene content of more 
than 37, are mainly due to the emergence of tRNA 
gene extra copies. For example, the lepidopteran 


Coreana raphaelis has an extra copy of tRNA™ 
(AGN) (Kim et al., 2006) , Acraea issoria has an 
extra copy of tRNA'*( AUR) (Hu et al., 2010) , the 
dipteran Chrysomya chloropyga has an extra tRNA 
(Junqueira et al., 2004), and some species of 
thysanopterans and hemipterans contain variable 
number of tRNA genes (Shao and Barker, 2003; 
Thao et al., 2004). 

The phylogeny and scientific classification of the 
family Papilionidae has long been a controversial and 
complicated issue (Su et al., 2007). It has become 
mtDNA for the 


phylogenetics and population genetics on the genomic 


popular to use studies of 
level, with the development of PCR and applicability 
of universal primers ( Kocher et al., 1989; 
Yamauchi et al., 2004 ). So far, only seven 
complete or nearly complete mitochondrial DNA 
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sequences of Papilionidae have been reported or 
determined: Papilio xuthus (Feng et al., 2010), 
Parnassius bremeri ( Kim et al., 2009), Papilio 
Luehdorfia Troides 
Lamproptera curius and Teinopalpus aureus, and 
among these species, 


maraho , chinensis , Geacus , 
the later five were only 
available directly from our determination ( Troides 
aeacus and Lamproptera curius, unpublished ) or 
download from the GenBank ( the GenBank accession 
numbers: Papilio maraho: NC. 014055, Luehdorfia 
chinensis: EU622524, Teinopalpus aureus: NC _ 
014398 ). 


papilionid butterfly species are needed to clarify this 


And thus, more mitogenomes of the 


problem with more details. 

The dragon swallowtail, Sericins montela Gray, 
is a representative species of the tribe Zerynthiinii, 
subfamily Parnassiinae | sometimes subfamily 
Zerynthiinae by some scholars such as Chou (1998) 
and Li and Zhu (1992) | in the family Papilionidae. 
It is commonly found in Russian, Far East, Korea, 


China and Japan ( Ackery, 1975) and may be the 


only multivoltine species in the Parnassiinae 
(Igarashi, 2003). In this paper, we report the 
complete mitochondrial genome of Sericins montela , 
providing the sequence information for the studies of 


butterfly mitogenomic evolution and phylogenetics. 


2 MATERIALS AND METHODS 


2.1 Specimen collection 
Adult individuals of S. montela were collected 


55 


at the Slender West Lake ( Yangzhou, Jiangsu 
Province, China) in September, 2008. The fresh 
materials were preserved in 100% alcohol and stored 
at —20'C until used for DNA extraction. 
2.2 PCR amplification and sequencing 

Total genomic DNA was extracted from thoracic 
muscle tissue using glass bead method after Hao et 
al. (2005). The entire genome of S. montela was 
amplified using standard PCR method and a set of 
mitochondrial universal primers ( Simon et al., 
1994; Caterino and Sperling, 1999; Simmons and 
Weller, 2001 ). Universal primers were used to 
amplify CO I, CO IL, Cytb and IrRNA (16S) 
genes. The primers for the short fragment ( NDS ) 
and the five long fragments | CO Į -CO II, CO I- 
NDS, ND5-cytb, Cytb-srRNA (12S), 12S-CO I | 
were designed by the multiple sequence alignments 
of the 


lepidopterans 


mitogenomes of the other 
ClustalX1. 8 
software ( Thompson et al., 1997 ) and Primer 


complete 
available, using the 
Premier 5.0 software. The long PCR amplification 
was performed using TaKaRa LA Taq polymerase 
with the following cycling parameters; 95°C for 5 
min; 30 cycles of 95°C for 50 s, 50 for 50 s, 
68°C for 2 min and 30 s; and a final extension step 
of 68°C for 10 min. The PCR products were detected 
via electrophoresis on 1. 2% agarose gel, purified 
using the 3S Spin PCR Product Purification Kit and 
sequenced directly with ABI-377 automatic DNA 
sequencer. All the sequences of the primers for each 


long PCR are listed in Table 1. 


Table 1 Primers for the long PCR amplification 
and their sequences used in this study 


Primers Upper primer sequence (5' -3’) 
COI-COII GATTGTTACCACCTTCTT 

CO II -NDS5 TTTTTGGACAATGTTCAG 
NDS-Cytb GGGAATACCACACAAAGC 
Cytb-128 ATTATTCCTCCATCAAACAG 
128-CO I AATCTGCATCTTGATCT 


2.3 Sequnce analysis and annotation 

The raw sequence was run in the NCBI BLAST 
program for the identification and rectification. of 
after the 
concatenation, the complete nucleotide sequences of 


sequence homology, and sequence 
the S. montela mitogenome was obtained. PCGs and 
rRNA genes were identified using the ClustalX1. 8 
software and the NCBI Internet BLAST search 
function. The tRNA gene analysis was conducted 


using tRNAscan-SE software v. 1. 21 ( http:// 
lowelab. ucsc. edu/tRNA Scan-SE ) the putative 


Lower primer sequence (5' -3’) 
ATCTATGATTTGCTCCAC 
TAGCTITGTGTGGTATTC 
CTGTTTGATGGAGGAATAAT 
TTAATTCAGATCAAGATGC 
ATTAGAAGAAAGAGGGG 


tRNAs not found by tRNAscan-SE were confirmed by 
sequence comparison with other lepidopterans. The 
nucleotide composition and codon usage were 
calculated in MEGA 4.0 software (Tamura et al., 
2007). The mitogenome sequence data have been 
deposited into the GenBank database under the 
accession number HQ259122. 
2.4 Sequence variation and phylogenetic analysis 
The multiple alignments of the concatenated 
amino acid sequences of the 13 PCGs were done by 
the ClustalX software. The genetic distances of S. 
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montela and the 7 representative papilionid butterfly 
species as mentioned above in the introduction were 
calculated, meanwhile, the neighbor joining ( NJ) 
(Saitou and Nei, 1987) and maximum parsimony 
(MP) (Fitch, 1971) phylogenetic trees of the 
papilionid species were reconstructed based on the 
PCG sequences using the JTT evolutionary model 
( Jones et al., 1992) by the MEGA 4.0 software. In 
the phylogenetic analysis, a nymphalid species 
Argreus hyperbius was selected as the outgroup, and 
the bootstrap values with internal branch tests were 
obtained by 1 000 replicates to estimate the support 
levels for the nodes in the resultant topologies of both 


NJ and MP trees. 


3 RESULTS 


Mitogenome of S. montela is 15 242 bp in 
length, slightly longer than that of Artogeia melete 
(15 140 bp) (GenBank accession no. NC. 010568) 
(Hong et al., 2009) , but much shorter than that of 
Papilio maraho (16 094 bp) ( GenBank accession 
no. FJ810212). It contains 37 genes that are found 
in typical metazoan animals; 13 PCGs, 2 rRNAs and 
22 tRNAs. The gene arrangement of the S. montela 
mitogenome (Fig. 1) was similar to that found in the 
inferred ancestral insect ( Boore, 2001). Nucleotide 
compositions of A (40.1%), T (40. 8%), G 
(7.4% ), C (11. 7%) are highly A + T biased 
(80. 896) , which corresponds well to the AT bias 
generally observed in other insect mitochondrial 
genomes, ranging from 69.5% to 84.9% (Crozier 
and Crozier, 1993; Dotson and Beard, 2001 ). 


Sericinus montela 
Mitochondrial DNA 


15 242 bp 





Fig. 1 Circular map of the mitogenome of Sericunus montela 
The abbreviations for the genes are as follows: CO I , CO I and CO II 
refer to the cytochrome oxidase subunits, CytB refers to cytochrome b, and 
ND1 -6 refer to NADH dehydrogenase components. tRNAs are denoted as 
one-letter symbol according to the IUPAC-IUB single letter amino acid 
codes. Gene name without underline indicates the direction of transcription 
from left to right and that with underline indicates night to left. 


3.1  Protein-coding genes 

The sequence of the 13 PCGs is 11 144 bp in 
length. Analysis of base composition at each codon 
position of the 13 PCGs of S. montela showed that 
the A + T content at the 3rd codon position 
(83.496) was higher than at the first (7796 ) and 
the second (80.2% ) codon position. 

Table 2 shows all of the start and stop codons of 
the 13 PCGs. All 13 PCGs of S. 
mitogenome use standard ATN as start codons 
(ND2, ND3, NDS, ND4, ND6, ATPase8 and 
CO | genes use ATT; ND1 uses ATA; ATPase6, 
COII, COM, ND4L and Cytb use ATG). For the 
stop codons, the typical TAA was used for the 
termination of the most putative protein sequences 
(Table 2) , whereas, for the ND4 and ND4L genes, 
they terminated with a single T, which is adjacent 
directly to tRNA™ downstream and to tRNA™ 
upstream, respectively. 

The number of codons (CDs) for the 13 PCGs 
is 3 691 in total, and all the codon usage of PCGs is 
listed in Table 3. Some codons have much more 
numbers and occur with relatively higher 
frequencies, such as the ATT(I), TTT(F), AAT 
(N) and AAA(K) are 288, 275, 274 and 265 in 
number, accounting for 7.8% , 7.45% , 7.4% and 
7.2% of all the codons, respectively. In addition, 
some synonymous codons occur randomly in the 


montela 


genome, and thus the RSCU (relative synonymous 
codon usage) is often used to detect this bias ( Table 
3). These codons and codon families exhibit a very 
similar behavior with those of other lepidopteran 
species ( Salvato et al., 2008). 

3.2 Transfer RNAs and ribosomal RNAs 

Like all other metazoan mtDNAs sequenced, S. 
montela mtDNA contains 22 tRNAs, which range 
from 62 to 71 bp in size. Twenty-one tRNA 
sequences can be folded into perfect cloverleaf 
secondary structure, with the exception of tRNA™ 
(AGN) gene, which does not form a stable stem 
loop structure in the DHU arm (Fig. 2). This case 
was seen in many other metazoan, including insect 
mtDNA tRNA?" (AGN) (Wolstenholme, 1992). 
Additionally, total 8 bp overlaps exist in all the 22 
tRNA sequences, the longest (8 bp) is located 
betwen tRNA™ and tRNA“ as reported in a dipteran 
species Cochliomyia hominivorax ( Lessinger et al., 
2000). 

The amino acid stems of all 22 tRNAs are 
invariant, all of them are 7 nt. Variability occurs at 
the DHU loops (3 —11 nt) and TWC loops (2 -8 
nt) (Fig. 2). 

The 1 338-bp IrRNA (16S) and 760-bp srRNA 
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Table 2 Organization of the Sericunus montela mitochondrial genome 








Size 


Intergenic 


Gene Direction Nucleotide position (bp) nucleotides Anticodon Start codon Stop codon 
tRNA™* F 1 -69 69 CAT 
tRNA” F 70 -134 65 0 GAT 
tRNACh R 132 - 200 69 -3 TTG 
ND2 F 266 -1 279 1 014 65 ATT TAA 
tRNA? F 1 278 -1 342 65 -2 TCA 
tRNA“ R 1 335 - 1 398 64 -8 GCA 
tRNA™ R 1 401 -1 466 66 2 GTA 
COI F 1 464 —3 005 1 542 -3 ATT TAA 
tRNATe™ UUR) F 3 001 -3 067 67 -5 TAA 
COII F 3 068 -3 724 657 0 ATG TAA 
tRNA F 3 749 -3 819 71 24 CTT 
tRNAA® F 3 819 -3 884 66 -1 GTC 
ATPS F 3 885 -4 052 168 1 ATT TAA 
ATP6 F 4 046 -4 723 678 -7 ATG TAA 
COIII F 4723 -5 511 789 -1 ATG TAA 
tRNA®Y F 5515-5 579 65 3 TCC 
ND3 F 5 579 -5 933 362 -1 ATT TAG 
tRNA“? F 5 932 -5 996 65 -2 TGC 
tRNAA F 5 996 -6 059 63 -1 TCG 
tRNAA™ F 6 060 -6 125 66 0 GTT 
tRNASer(ACN) F 6 129 -6 190 62 4 GCT 
tRNAC F 6 191 -6 261 71 0 TTC 
tRNA? R 6 260 -6 323 64 -2 GAA 
NDS R 6 323 -8 038 1 716 -1 ATT TAA 
tRNA R 8 054 -8 119 66 15 GTG 
ND4 R 8 117 -9 449 1 333 -3 ATT T--tRNA 
ND4L R 9 459 -9 733 274 9 ATG TA--- 
tRNA™ F 9 747 -9 811 65 13 TGT 
tRNA?” R 9 812 -9 877 66 0 TGG 
ND6 F 9 880 — 10 410 5 231 2 ATT TAA 
Cytb F 10 407 - 11 570 1 164 -4 ATG TAA 
tRNASer(UCN) F 11 577 -11 644 68 6 TGA 
ND1 R 11 663 - 12 598 9 536 18 ATA TAA 
tRNATex(CUY R 12 603 —12 672 70 4 TAG 
IrRNA R 12 673 - 14 010 1 338 0 
tRNA R 14 011 -14 074 64 0 TAC 
srRNA R 14 075 - 14 833 760 0 
D-loop 14 834 —15 241 408 


The tRNA abbreviations follow the IU-PAC-IUB three-letter code. For other abbreviations see Fig. 1 legend. 
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Table 3 Codon usage of PCGs in the Sericunus montela mitogenome 











Codon (aa) N (RSCU) Codon (aa) N(RSCU) 
UUU(F) 275.0(1.54) UCU(S) 53.0(2.15) 
UUC(F) 83.0(0. 46) UCC(S) 22.0(0.89) 
UUA(L) 192.0(2.32) UCA(S) 47.0(1.91) 
UUG(L) 63.0(0.76) UCG(S) 9.0(0.37) 
CUU(L) 105.0(1.27) CCU(P) 47.0(1.72) 
CUC(L) 31.0(0.38) CCC(P) 26.0(0.95) 
CUA(L) 78.0(0.94) CCA(P) 32.0(1.17) 
CUG(L) 27.0(0.33) CCG(P) 4. 0(0. 15) 
AUU(1) 288.0(1.57) ACU(T) 47.0(1.68) 
AUC(1) 78.0(0.43) ACC(T) 24.0(0. 86) 
AUA(M) 240.0(1.73) ACA(T) 31.0(1.11) 
AUG(M) 37.0(0.27) ACG(T) 10. 0(0. 36) 
GUU( V) 26.0(1.63) GCU(A) 26.0(2.42) 
GUC( V) 6.0(0.38) GCC( A) 4.0(0.37) 
GUA(V) 26.0(1.63) GCA( A) 12.0(1.12) 
GUG(V) 6.0(0.38) GCG( A) 1.0(0.09) 





Codon (aa) N(RSCU) Codon (aa) N(RSCU) 
UAU( Y) 187.0(1.65) UGU( C) 16.0(1.33) 
UAC(Y) 39.0(0.35) UGC(C) 8.0(0.67) 
UAA(*) 259.0(1.61) UGA(W) 38.0(1.58) 
UAG( * ) 62.0(0.39) UGG(W) 10. 0(0. 42) 
CAU(H) 64.0(1.64) CGU(R) 6.0(1.26) 
CAC(H) 14.0(0.36) CGC(R) 2.0(0.42) 
CAA(Q) 68.0(1.36) CGA(R) 10.0(2. 11) 
CAG(Q) 32.0(0. 64) CGG(R) 1.0(0.21) 
AAU(N) 274.0(1.80) AGU(S) 12. 0(0. 49) 
AAC(N) 31.0(0.20) AGC(S) 12. 0(0. 49) 
AAA(K) 265.0(1.83) AGA(S) 26.0(1.06) 
AAG(K) 24.0(0.17) AGG(S) 16.0(0.65) 
GAU(D) 78.0(1.61) GGU(G) 12. 0(1.50) 
GAC(D) 19.0(0.39) GGC(G) 3.0(0.38) 
GAA(E) 91.0(1.40) GGA(G) 14.0(1. 75) 
GAG(E) 39. 0(0. 60) GGG(G) 3.0(0.38) 


A total of 3 691 codons are analyzed, excluding the initiation and termination codons. N; Number of each codon. RSCU: Relative synonymous codon 


usage. " Stop codon. 


genes are located between tRNA” and tRNA, and 
between tRNA“ and A + T-rich region, respectively 
(Table 2). They are well within the size of other 
insects, such as Parnassius bremeri (1 344 bp and 773 
bp), and Papilio maraho (1 332 bp and 778 bp). 
The AT contents of both rRNAs are 83. 696 and 
84.6% , respectively. 
3.3 Non-coding regions 

For $. montela in this study, the A + T content 
of AT-rich region is up to 94.1% and spans 408 bp 
in length, which is a little longer than that of 
Coreana raphaelis (375 bp) , but much shorter than 
Papilio maraho (1 269 bp) among the known 
butterfly species. The S. montela AT-nch region 
does not contain any conspicuous macro-repeat 
units, which are commonly found in other insect 
species but absent from all the lepidopteran species 
sequenced to date ( Zhang and Hewitt, 1997; Hu et 
al., 2010; Liao et al., 2010). However, it includes 
(TA), and ( AT), microsatellite-like regions, and 
the former is preceded by a conserved motif ATTTA. 
The second poly-T region is close to the upstream of 
the tRNA", spanning 8 bp long. 

Besides the AT-rich region, there are another 
13 non-coding regions (intergenic spacer) in the 
complete nucleotide sequences of S. montela. They 
are 1 —65 bp in size (five of them are over 10 bp): 
the longest (65 bp) (spacer 1) is located between 


tRNA^"and ND2 genes, and this case is usually 
observed in many other lepidopteran insects ( Lee et 
al., 2006; Hu et al., 2010), but is not found in 
The 24 bp long 


intergenic spacer (spacer 2) existed between COII 


other insect species till now. 


gene and tRNA™ gene, and this phenomenon was not 
found in any other lepidopteran species, which 
maybe derived from the duplication of one of these 
tRNAs ( Lavrov et al., 2002). The spacer of 18 bp 
(spacer 3) locating between tRNA gene and ND1 
gene is similar to most insect mitochondrial genomes, 
varying remarkably in size among lepidopteran 
species. Another two spacers, which are larger than 
10 bp in size are located between NDS and tRNA"* 
(15 bp) and between ND4L and tRNA (13 bp), 
respectively. Additionally, there is a 7 bp motif 
( ATACTAA) located between the ND1 gene and 
tRNA?" vene. In addition, there are total 
fifteen overlapping sequences ranging from 1 bp to 8 
bp in size scattered in the whole mitogenome, and 
among which, the longest overlap (8 bp) is located 
between the tRNA" gene and tRNA“ gene ( Table 
2). 
3.4  Phylogenetic analyses 

In this study, corresponding sequence variation 
and phylogenetic analyses were conducted to further 
clarify the phylogenetic relationships among the main 
lineages of papilionid butterflies as done in dipterans 
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Fig. 2 Predicated secondary clover-leaf structure for the 22 tRNAs of Sericunus montela 
The tRNAs are labelled with the abbreviations of their corresponding amino acids. Nucleotide sequences from 5’ to 3' as indicated for tRNA“". Dash ( -) 


indicates Watson-Crick basepairing, and centred dot ( * ) indicates G-U base-pairing. Arms of tRNAs ( clockwise from top) are the amino acid acceptor 


(AA) arm, TWC (T) arm, the anticodon (AC) arm, and dihydrouridine ( DHU or D) arm. 
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by Cameron et al. (2007). The 13 concatenated PCG 
sequences of the 8 papilionid species were multiple 
aligned using the ClustalX1.8 software, and the results 
show that the aligned sequences covers 10 360 
nucleotides in total, including 6 480 conservative, 
3 871 variable and 2 148 parsimony-informative sites, 
respectively. The results of phylogenetic analysis show 
that the resultant NJ and MP trees share the same 
topology , in which one branch is the grouping of the S. 
montela , Luehdorfia chinensis and Parnassius bremeri , 
and Lamproptera curius with the relations of (((S. 


A 76 Sericinus montela 
99 | 一 一 ww chinensis 
70 Parnassius bremeri 


99 Lamproptera curius 
Teinopalpus aureus 
99 Papilio xuthus 
Papilio maraho 
Troides aeacus 


Argreus hyperbtus 


montela , Luehdorfia chinensis) , Parnassius bremeri ) , 
Lamproptera curius ); the second branch is the 
clustering of Papilio xuthus, Papilio maraho and 
Teinopalpus aureus, and the two Palilio species is sister 
to Teinopalpus aureus; the third branch is the Troides 
aeacus , which is sister to the grouping of the above two 
branches ( Fig. 3). These results are remarkably 
congruent with those obtained by the combined nuclear 
and mitochondrial genes analyses by Caterino et al. 


(2001). 


B 70 Sericinus montela 
84 — chinensis 
76 Parnassius bremeri 


96 Lamproptera curius 
Teinopalpus aureus 
100 Papilio xuthus 
Papilio maraho 
Troides aeacus 


Argreus hyperbtus 


Fig. 3 NJ (A) and MP (B) phylogenetic trees based on the mitogenome PCG amino acid sequences with the JTT model 


Numbers above branches indicate bootstrapping values with 1 000 replications. 


4 DISCUSSION 


4.1  Protein-coding genes 

The facts of the PCG AT bias and the higher AT 
content of the third codon position compared with the 
first and second may be correlated with the lower 
pressure of the third codon site, and this case 
corresponds with the GC—AT trends of substitutions 
in the animal mitochondrial genes ( Ye et al., 
2008). 

The start codon for CO | of S. montela was 
standard ATT, 
codons commonly found in other lepidopteran species 
examined to date ( Fenn et al., 2007, Liao et al., 
2010) , such as in the cases of Adoxophyes orana, 


other than those non-canonical 


the codon CGA, typically special for Arginine was 
tentatively assumed as the CO [ start codon ( Lee et 
al., 2006 ), 
tetranucleotide TTAG assumed as the initiation site 
for CO I (Kim et al., 2006) ; in addition, Clary 
and Wolstenholme (1985) and de Bruijn (1983) 
also considered the start codon of CO [| were 
tetranucleotides ATAA, TTAA or ATTA in some 
insect species. Moreover, some hexanucleotides, 
such as TATTAG in Ostrinia furnicalis and Ostrinia 
nubilalis ( Coates et al., 2005 ), ATTTAA in 
Anopheles gambiae ( Beard et al., 1993) , TATCTA 
in Penaeus monodon ( Wilson et al., 2000) , and 


and Coreana raphaelis, the 


Anopheles quadrimaculatus ( Mitchell et al., 1993) , 
were recognized as the start codon for CO [ , etc. 
Thus, the issue about the CO [ start codon is still 
far from reaching consensus, and more studies for 
mRNA transcripts would be needed to clarify the 
prediction on the CO [| initiation position. 

The number and frequency of codon usage of S. 
montela are similar with those reported in all other 
insect species (De Brujin, 1983; Wilson et al., 
2000; Coates et al., 2005; Kim et al., 2006). 

4.2 Transfer RNAs and ribosomal RNAs 

In general, the sizes and structures of 22 tRNAs 
and 2 rRNAs, as well as the AT contents of both 
tRNAs and rRNAs are also well within the range of 
the other sequenced insects ( Beard et al., 1993; 
Kim et al., 2006, 2009; Fenn et al., 2007; Feng et 
al., 2010; Hu et al., 2010). 

4.3 Non-coding regions 

The AT-rich region known for the initiation of 
replication in vertebrates and invertebrate animals is 
located between the srRNA gene and tRNA“ gene 
(Fig. 1), and this mitochondrial control region is 
considerably more conservative in structure than most 
other insect groups in that it usually is consisted of 
five readily identifiable regions; the putative origin 
of minority or light strand replication ( ON ) ; the 
microsatellite containing region; a polythymidine 
(poly-T) stretch; a highly variable region (150 - 
375 bp in size); and a second poly-T stretch 
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immediately upstream of tRNA"* ( Cameron and 
Whiting, 2008). 

To our knowledge, the absence of the 
macrosatellilite-like units in S. montela AT-rich 
region are commonly found in other insect species 
but absent from all the Species 
sequenced to date ( Zhang and Hweitt, 1997; 
Cameron and Whiting, 2008; Hu et al., 2010; Liao 
et al., 2010); however, the microsatellite-like 
regions are presented as distinguishing features, such 
as the ( TATTA),, between tRNA“ and tRNA™ in 
Adoxophyes orana ( Lee et al., 2006), ( TA), and 
( TA),, between ND3 and tRNA^" in Bombyx mori 
( Yukuhiro et al., 2002) , (TAA), between tRNA?” 
and ND5 in Papilio maraho , etc. A poly-T stretch is 
found close to the srRNA, and this polythymidine 


stretch has been postulated to be a transcription 


lepidopteran 


control and/or the initiation of replication ( Zhang 
and Hweitt, 1997). 

From the available lepidopterean mitogenome 
data, it is easy to find that the longest intergenic 
spacer shows limited sequence conservation even 
related 
( Ostrinia furnicalis and O. nubilalis ( Coates et al., 
2005) ; Bombyx mori and B. mandarina (Pan et al., 
2008 ) ) Additionally, the 7 bp motif ATACTAA is 


relatively conserved in location across the order 


between closely lepidopteran species 


Lepidoptera, and to a lesser extent across insect 
groups ( Beard et al., 1993; Dotson and Beard, 
2001; Yukuhiro et al., 2002; Lee et al., 2006; 
Cameron et al., 2007; Kim et al., 2006, 2009; Ye 
et al., 2008; Pan et al., 2008; Hu et al., 2010; 
Feng et al., 2010; Liao et al., 2010). 
4.4 Phylogenetic analyses 

The phylogenetic 


relationships among the 


subfamilies Zerynthiinae, Parnassiinae, and 
Papilioninae, especially the phylogenetic position of 
the Zerynthiinae have long been controversial. Up to 
date, there were many different opinions about this 
problem as mentioned below: 1) Wu (2001) 
proposed that the Papilionidae should be divided into 
three subfamilies: Papilioninae, Parnassiinae and 
Zerynthiinae which is further divided into three 
genera: Bhutanitis ( Atkinson ), Luehdorfia 
(Cruger) and Sericinus; 2) Chou (1998) insisted 
that the parnassians should be lifted up as a family 
parallel to the family Papilionidae which is divided 
into two subfamilies of Papilioninae and 
Zerynthiinae; 3) Li and Zhu (1992) suggested that 
the parnassids should be categoried as the taxon of 
the familial level ( Parnassiidae ) parallel to the 
family Papilionidae, and the family Parnassiidae was 


devided into two subfamilies of Parnassiinae and 


Papilioninae; 4) whereas, some European and 


American experts are inclined to regard them either 
as the taxonomic level of tribes, that is the 
Parnassiini and Zerynthiini, or as the subfamilial 
level ( Parnassiinae and Zerynthiinae ) ( Munroe, 


1961; Hancock, 1983; Igarashi, 1984; Liu, 1999; 
Ackery et al., 1999). According to the results of 
this study on the grounds of mitogenomic data, it is 
reasonable to propose that the zerynthiids ( including 
S. montela and Luehdorfia chinensis) should be 
categorized as a taxon of tribal level ( the tribe 
Zerynthini ) which was commonly accepted by 
international scholars at present, and constitute the 


subfamily Parnassiinae together with the tribe 


Parnassiinii. 
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摘要 : 锯 风 蝶 类 与 凤 蝶 科 其 他 类 群 的 系统 发 生 关 系 及 其 分 类 学 地 位 一 直 存 在 争议 。 本 人 研究 采用 PCR 和 long PCR 
技术 测定 了 属于 锯 凤 蝶 类 的 丝带 风 蝶 Sericinus montelus 线粒体 基因 组 全 序列 ; 结合 已 有 的 其 他 凤 蝶 科 物 种 的 相应 
序列 数据 , 基于 13 个 蛋白质 编码 基因 重建 了 凤 蝶 科 主 要 类 群 的 系统 发 生 树 , 探讨 了 它们 之 间 的 系统 发 生 关 系 。 
基因 组 分 析 结 果 表 明 : 丝带 风 蝶 线粒体 基因 组 全 长 15 242 bp, 包括 13 个 编码 蛋白 基因 ( ATP6, ATPS, COT - M, 
ND1 -6, ND4L 和 Cytb)、22 个 tRNA 基因 、16S 和 12S rRNA 基因 以 及 非 编 码 的 控制 区 ; 基因 组 A, T, G 和 C 含量 
分 别 为 40.1% , 40.8% , 7.4% 和 11.7%, 表现 出 明显 的 AT 偏 位 。 所 有 的 蛋白 质 编 码 基 因 都 使 用 标准 的 起 始 密 码 
子 (ATN); 除 ND4 和 ND4L 基因 使 用 单个 的 T 作 为 终止 密码 子 外 , 其 余 和 蛋白 编码 基因 都 使 用 了 标准 的 终止 密码 子 
(TAA)。 除 丝氨酸 tRNA BS SUR H RRA, 所 有 tRNA 基因 都 形成 典型 的 三 叶 草 型 结构 。 基 因 组 中 共存 在 
12 个 大 小 介 于 2~65 bp 之 间 的 基因 间隔 区 以 及 15 个 大 小 介 于 1 ~8 bp 之 间 的 基因 重叠 区 , 其 中 , FEF COM A 
tRNA “之 间 的 24 bp 的 间隔 区 在 其 他 鳞 翅 目 昆 虫 中 未 曾 见 到 。 以 邻接 法 和 最 大 简约 法 并 基于 13 个 蛋白 质 编码 基 
序列 对 凤 蝶 科 进 行 了 系统 发 生 分 析 。 结 有 果 显 示 , Lir KRPE RARER Luehdorfia chinensis 先 构 成 一 个 文系 , 再 
和 冰 清 绢 蝶 Parnassius bremeri 构成 姊妹 群 ; 表明 锯 凤 蝶 类 应 作为 族 级 分 类 单元 归于 风 蝶 科 下 的 绢 蝶 亚 科 。 

关键 词 : EOSE KW; 丝带 凤 蝶 ; 线粒体 基因 组 ; 系统 发 育 分 析 
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